Picture for Shuai Wang

Shuai Wang

The Hong Kong University of Science and Technology

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Add code
May 29, 2026
Viaarxiv icon

A Unified and Reproducible Experimentation Framework for Speech Understanding

Add code
May 29, 2026
Viaarxiv icon

Parameter-Efficient Subspace Decoupling ViT for Mitigating Multi-Task Negative Transfer in Histological Scoring

Add code
May 28, 2026
Viaarxiv icon

Can It Reach the Generator? Investigating the Survival of Prompt-Injection Attacks in Realistic RAG Settings

Add code
May 28, 2026
Viaarxiv icon

Audio-Mind: An Auditable Agentic Framework for Audio Understanding

Add code
May 27, 2026
Viaarxiv icon

Workflow Closure Is Not Scientific Closure in Auto-Research Systems

Add code
May 25, 2026
Viaarxiv icon

Route Before Retrieve: Activating Latent Routing Abilities of LLMs for RAG vs. Long-Context Selection

Add code
May 11, 2026
Viaarxiv icon

Ground4D: Spatially-Grounded Feedforward 4D Reconstruction for Unstructured Off-Road Scenes

Add code
May 06, 2026
Viaarxiv icon

Full-Duplex Interaction in Spoken Dialogue Systems: A Comprehensive Study from the ICASSP 2026 HumDial Challenge

Add code
Apr 23, 2026
Viaarxiv icon

A-MAR: Agent-based Multimodal Art Retrieval for Fine-Grained Artwork Understanding

Add code
Apr 21, 2026
Viaarxiv icon